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Abstract — This paper introduces a new approach to solve 
sensor management problems. Classically sensor management 
problems can be well formalized as Partially-Observed Markov 
Decision Processes (POMPD). The original approach developped 
here consists in deriving the optimal parameterized policy based 
on stochastic gradient estimation. We assume in this work that 
it is possible to learn the optimal policy off-line (in simulation 
) using models of the environement and of the sensor(s). The 
learned policy can then be used to manage the sensor(s). In order 
to approximate the gradient in a stochastic context, we introduce 
a new method to approximate the gradient, based on Infinitesimal 
Approximation (IPA). The effectiveness of this general framework 
is illustrated by the managing of an Electronically Scanned Array 
Radar. 

Keywords: Sensor(s) Management, Partially Observable 
Markov Decision Process, Stochastic Gradient Estimation, 
AESA Radar. 
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I. Introduction 

Years after years the complexity and the performances 
of many sensors have increased leading to more and more 
complex sensor(s)-based systems which supply the decision 
centers with an increasing amount of data. The number, the 
types and the agility of sensors along with the increased quality 
of data far outstrip the ability of a human to manage them: it is 
often difficult to compare how much information can be gained 
by way of a given management scheme [1]. It results from this 
the necessity to derive unmanned sensing platforms that have 
the capacity to adapt to their environment [2]. This problem 
is often refered as the Sensor(s) Management Problem. In 
more simple situations, the operational context may lead to 
works on sensor(s) management like in the radar - infrared 
sensor case [3]. A general definition of this problem could 
then be : sensor management is the effective use of available 
sensing and database capabilities to meet the mission goals. 
Many applications deal with military applications, a classical 
one being to detect, to track and tp identify smart targets 
(a smart target can change its way of moving or its way 
of sensing when it detects it is under analysis) with several 



sensors. The questions are then the following at each time: 
how must we group the sensors, how long, in which direction, 
and with which functioning mode? The increasing complexity 
of the targets to be detected, tracked and identified, makes the 
management even more difficult and led to the development of 
researches on the definition of an optimal sensor management 
scheme in which the targets and the sensors are treated 
altogether in a complex dynamic system [4]. 

Sensor Management has become very popular this last years 
and many approaches can be found in the litterature. In [5] and 
[6] the authors use a the modelling of the detection process 
of an Electronically Scanned Array (ESA) Radar to propose 
management scheme during the detection step. In [7]-[9] an 
information-based approach is use to manage a set of sensors. 
From a theorical point of view the sensor management can be 
modelled as a Partially Observable Markov Decision Process 
(POMDP) [10]— [12]. Whatever the underlying application, the 
sensor management problem consists in choosing at each time 
t an action A t within the set A of available actions. The 
choice of A t is generally based on the density state vector 
X t describing the environment of the system and variables 
of the system itself. It is generally assumed that the state 
or at least a part of this state is Markovian. Moreover in 
most of the applications, we only have access to a partial 
information of the state and X t must be estimated from the 
measurements {Y s }i< s <t. This estimation process is often 
derived within a Bayesian framework where we use state- 
dynamics and observation models such as: 

X t+l = F{X u A u N t ) (1) 

Y t =H(X t ,W t ) (2) 

where N t , Wt, F and H respectively stands for the state 
noise, the measurements noise, the state-dynamics and the 
measurement function. F and H are generally time varying 
functions. The control problem consists in finding the schedul- 
ing policy 7r i.e. select A t given the past and the possible 



futures. However, this control problem may have a theorical 
solution, it is generally untractable in practice. However few 
works propose optimal solution in the frame of POMDPs 
like [12]. Beside, several works have been carried out to 
find sub-optimal policies like for instance myopic policies. 
Reinforcement Learning and Q-Learning have also been used 
to propose a solution ( [13], [14]). 

We propose in this paper to look for a policy within a class 
of parametrized policy ng and to learn it which means learn 
the optimal value of 9. Funding our work on the approach 
described in [15] we assume that it is possible to learn this 
policy in simulation using models of the overall system. Once 
the optimal parameter has been found it is used to manage 
the sensor(s). The frame of this work being the detection and 
localization of targets, we show in the last part of this paper 
how it may be applied the the management of an ESA radar. 

The section [II] described the modelling of a sensor man- 
agement problem using a POMDP approach. In the section 
ITU we derive the algorithm to learn the parameter of the 
policy.In section [IV] we show how this method may be used 
for the tasking of an ESA radar. Finally section [V] exhibits 
firts simulations results. 

II. Modelling 

A. POMDP Modelling 

Let us consider three measurable continuous spaces denoted 
by X, A and y, X is called the state space, y the observation 
space and A the action space. We call Ai(X) the set of 
all the measures defined on X. A Partially-Observable De- 
cision Proceess is defined by a state process (X t )t>o € X, 
an observation process {Y t )t>i £ y and a set of actions 
(At)t>i € A. In these definitions t stands usually for the 
time. The state process is an homogeneous Markov chain with 
initial probability measure fi(dxo) € M(X) and with Markov 
transition kernel K(dx t+ x\x t ) ( [16]): 

Vt>0,X t+ i~K(-\X t ) (3) 

X Q ~ H (4) 

(Yt)t>i is called the observation is linked with the state 
process by the conditional probability measure: 

V(Y t €dy t \X t = x t )=g(x u yt)dyt (5) 

where g : X x y — > [0, 1] is the marginal density function 
of Y t given X t . In a general way, the state process evolves 
continuously with respect to time t whereas the observations 
are made at sampled time t n . A new observation is used 
to derive a new action. We will therefore consider in the 
following the processes (X t )t>o, (Y n )n&i> (A„)„ eN where 
n stands for the index of the observation. We also assume 
that there exists two generative functions F^ : U — > X and 
F : X X U — * X, where (U,a(U), v) is a probability space, 
such that for any measurable test function f defined over X 
we have: 



J f{x t )K{dxt\xt-x) = J f{F{xt-x,u))v{du) (6) 
and 

J f(x )n(dx ) = J f{F^u))v{du). (7) 

In many practical situations, U = [0, l] nu , and u is a njj- 
uple of pseudo random numbers generated by a computer. 
For sake of simplicity, we adopt the notations K(dxo\x-x) = 
fi(dxo) and F(x-i,u) = F^u). Under this framework, the 
Markov Chain (X t )t>o is fully specified by the following 
dynamical equation: 

X t+1 = F(X t ,U t ), U t iA ~ d -v (8) 

The observation process (Y n ) n ^n, defined on the measur- 
able space (y,a(Y)), is actually linked with the state process 
by the following conditional probability measure 

P(¥« € dy n \X tn = x tn ,An) = g(y n ,x tn ,A n )X(dy t ) (9) 

where A n € A is defined on the measurable space 
(A, cr{A)) and A is a fixed probability measure on (y, a(Y)). 
As we assume that observations are conditionally independent 
given the state process we cab write VI < i,j < t, i ^ j: 

P(Fi € dyi,Yi € dyjlXo^A^Aj) = 
P(r, e dyilXo^AJFiYj e dyj\X 0:t ,Aj) (10) 

where we have adopted the usual notation Zi-.j = (zk)i<k<j- 

B. Filtering distribution in a Partially-Observable Markov 
Decision Process 

Given a sequence of action A\- n and a sample trajectory of 
the observation process yx- n and indices {nx, ?i2> tx> £2} such 
that 1 < nx < n,2 < n and < tx < t ni < t n2 < t% < t n , 
we define, using the , the posterior probability distribution 

M ti:t 

2 |ni:n 2 

(dx tl[t2 ) by ( [17]): 

^(Xt 1 :t 2 € dXt 1 :t 2 \Yn 1 :n2 = J/rM:«2 ) AliMj ) (ID 

Using the Feynman-Kac framework, the probabilty QT| can 
be written: 

f X H-H Utu K(dx t \x t _x)U]l ni GtM 3 Y 

where for simplicity's sake, G tn {x tn ) = g(y n , x trl , A n ) and 
Go (2^0) = 0. One of the main interest here is to estimate the 
state at time t from noisy observations yx-.n t with fh the index 
of the last observation just before time t. From a bayesian 
point of view this information is completely contained in the 
so-called filtering distribution M t:t | 1: „ t . In the following, the 
filtering distribution will simply be denoted as M t . 



C. Numerical methods for estimating the filtering distribution 
Given a measurable test function / : X — * K, we want to 
evaluate 

M t (f) = E[/(X t )|F 1:nt - y l:nt ,A 1:nt ] (13) 
which is equal, using the Feynman Kac framework, to: 



(14) 



In general, it is impossible to find M t (f) exactly except for 
simple cases such as linear/gaussian (using Kalman filter) or 
for finite state space Hidden Markov Models. In the general 
dynamics, continuous space case considered here, possible 
numerical methods for computing M t (/) include the Extended 
Kalman filter, quantization methods, Markov Chain Monte 
Carlo methods and Sequential Monte Carlo methods (SMC), 
also called particle filtering. The basic SMC method, called 
Bootstrap Filter, approximates M t (/) by an empirical distribu- 
tion M t N ( f) = ± YliLi f( x i) made of N so-called particles 
( [18]). It consists in a sequence of transition/selection steps: 
at time t, given observation y t ( [15]): 

. Transition step: (also called importance sampling or 
mutation) a successor particles population x\ :N is gen- 
erated according to the state dynamics from the previous 
population x]'^. The (importance sampling) weights 



w 



1:N 



90 



,Vt) 



are evaluated. 



* Ef=ifl(2?.»t) 

Selection step: Resample (with replacement) N particles 



x\ from the set xj: according to the weights w 



1:N 

t 



We write x\ :N = x t * where kj :N are the selection 
indices. 

Resampling is used to avoid the problem of degeneracy of 
the algorithm, i.e. that most of the weights decreases to zero. It 
consists in selecting new particle positions such as to preserve 
a consistency property : 



JV 

E 



i 



N 



E[^2^(*J)] 

i=l 



(15) 



The simplest version introduced in [19] consists in choosing 
the selection indices k} :N by an independent sampling from 
the set 1 : N according to a multinomial distribution with 



parameters w\ :N , i.e. 



t = j) = w 3 t , for all 1 < i < N. 



The idea is to replicate the particles in proportion to their 
weights. The reader can find some convergence results of 
Mf(f) to M t (f) (e.g. Law of Large Numbers or Central 
Limit Theorems) in [17], but for our purpose we note that 
under weak conditions on the test function and on the HMM 
dynamics, we have the asymptotic consistency property in 
probability, i.e. limbec M t N (f) I M t (f). 

III. Policy Learning Algorithm 

A. Optimal Parameterized Policy for Partially-Observable 
Markov Decision Process 



Let R t be a real value reward function 

R t ±R(X t ,M t (f)) . 



(16) 



The goal is to find a policy 

7T : A n x y n -> A 

that maximizes the criterion performance : 



J n = [ E[Rt]dt 
Jo 



(17) 



(18) 



where T is the duration of the scenario. Designing in 
practice policies that depend on the whole trajectory of the past 
observations/actions is unrealistic. It has been proved that the 
class of stationary policies that depend on the filtering distri- 
bution conditionally to past observations/actions M t contains 
the optimal policy. In general the filtering distribution is an 
infinite dimensional object, and it cannot be represented in a 
computer and so is the policy. We therefore propose to look 
for the optimal policy in a class of parameterized policies 
(^a)aer that depend on a statistic of the filtering distribution 



A n+1 =7T„(M tB (/)) 



(19) 



where / is any test function. As the policy ir is parameter- 
ized by a, the performance criterion now depends only on a. 
Thus we can maximize it by achieving a stochastic gradient 
ascent with respect to a : 



a k+1 = a k + r? fe VJ Qfc , k > 



(20) 



where VJ ak denotes the gradient of J ak w.r.t a^- By 
convention V J ak is column vector whose i-th component is 
the partial derivative with respect to on. {rjk)k>o is a non- 
increasing positive sequence tending to zero. We present in 
the two following subsection a possible approach to estimate 
V J ak based on Infinitesimal Perturbation Analysis (IPA). 

B. Infinitesimal Perturbation Analysis for gradient estimation 
We assume that we can write the following equality at each 

k: 

VJ a = f \7 a E[R t ]dt (21) 
Jo 

Proposition 1: We have the following decomposition of 
the gradient 

V a E[R t ] = E[M t (fS t )V MtU) R t ] 

- E[M t (f)M t (S t )V Mt (f)Rt] 

+ E[R t S t ] (22) 



where 



St 



Vt 



V a G t] (X tj ) 



(23) 



Proof: First let us rewrite V Q E[i? t ] as following: 

V a E[R t ] = V Q / R t U t V t TT \{dy 3 ) (24) 

Jx*xy n t 

where: 



Ut =n\ =0 K(dxi\xi-i) , 
V t =U]UG tj (x tj ) 



(25) 



Remarking that only R t and Vt depends on a so that we 
obtain 



v a v t = s t v t , 

V a R t =V a M t (f)V Mt (f)Rt 

where St is given by eq.d23l>. Incorporating d24l > in 
obtain 

V E[Jfc] = E[V a M t (/)V Mt(/)j R t ] . + E[R t S t ] 
Now using one more time 1261 . we have 



(26) 



we 



(27) 



V«Af t (/) 



V„E 



E 



f{X t ) 



= E 



f(Xt)S t 



E[V t ] 



f(X t 



y t E[v a Vt] 



MtStM 



E[V t 

M t (fS t )-M t (f)M t (S t 



E[V t 
V t 



E[V t ] 



(28) 



so that we obtain ( 122b by incorporating d28l l in d27l ). 



We can deduce directly Algorithm 1 from d22l) . It is impor- 
tant to note that we must deal with two time-scales. This first 
and the shorter one allows to simulate the continuous state X t . 
On the contrary the observation and action process are updated 
only each time we get a new observation. These specific time is 
denoted t n in the algorithm. That is the eason while there is an 
alternative to update the variables St and wl_ ± . A new action 
A n is also calculated each t n as already explained above. 
One can also be surprised to calculate R(X t , M t (f)) using 
the sampled value of X t . To well understand this algorithm 
we must remind that the learning is made off-line using a 
simulated process. It is therefore possible to use the real value 
of X t in this case. 

IV. Application to the ESA Radar 

The ESA is an agile beam radar which means that it is 
able to point its beam in any direction of the environnement 
almost instantaneously without inertia. However, the targets in 
the environement are detected w.r.t a probability of detection 
which depends on the direction of the beam and the time 
of observation in this direction. In the following, we precise 
first the nature of an action, then the influence of the action 
onto the probability of detection and finally the nature of the 
observations. 

Definition of the action: The main property of an ESA is 
that it can point its beam without mechanically adjusting the 
antenna. An ESA radar provides measurements in a direction 
9. We note S, the time of observation in this direction. In this 
work the the n-th action is : 



Algorithm 1 Policy Gradient in POMDP via IPA 
Initialize ao E T 
for k = 1 to oo do 
for t = 1 to T do 

Sample u t ~ v 

Set x t = F(x t -i,ut), 

If t = t n , sample y n ~ g(.,x t ,a n )X(.) 



Set s t = < St 1 T g{ x t,y n ,a n ) 



if t = t n 
else 



[ S t -1 

Set Vi € {1 . . . , /} 

xf 1 = F(x[ l }_ 1 , at-ijU^) where ^ v 

if t = t n 



S '-! + ,„„,«„) 
4-! 



else 



gjx^ ,y rl ,a rl )w < t '2 1 . f 



WfZ 1 else 

Set (xl l) ,sl = (x ( t l \s ( t l) )t & {k u ...M I }, kui 
are selection indices associated to (w^)i<={i n, 

m t (f) = ^EJ(4% Mst) = }"L. s w, 
m i (/ St ) = iE i /(4 l) )4 l) . 

a n +\ = 7r Qfc (m t ) if t = t„ 
r t = R(x t ,m t {f)) 

Vr t = (m t (fs t ) - m t (f)m t (st)) d ^f {f) {x t ,m t {f)) + 
r t s t 

V J Qfc = V J ak + Vr t 
end for 

"fc+l = &k + VkVJa k 

end for 



with 



ehf,fJ 



On e 



Vn > 



(30) 



This is a simple possible action. One could increase the 
number of componenets of an action by adding the emitted 
frequency for instance. The action does not influence directly 
the observation produced by the ESA but the probability of 
detection of a target. 

The probability of detection Pd'- It refers to the probabil- 
ity to detect a target and therefore to the probability to obtain 
an estimation of the state of a target p at time t n denoted 
Xt n , P with action A n . In this work, X tn . p is composed of the 
localisation and velocity components of the target p at time 
t n in the x-y plane: 

Xt n , P = [rx tniP ry tn:P vx tn . v ry tn , p ] T (31) 

where the subscript T stands for matrix transpose. The terms 
rxt n , p and ry tn , p refers here to the position and vx tn . p and 
vyt n , p the velocity of target p at time t n . We also denote D n p 
the random variable which takes values 1 if the radar produces 
a detection (and therefore an estimation) for target p and else 



A n = [e n S n ] ' 



(29) 



D n = [£>„ 



D r . 



(32) 



As said previously, this probability also depends on the time 
of observation S n . Aerial targets being considering here, the 
reflectivity of a target can be modelled using a Swerling I 
model [20]. We then have the following relation between the 
probability of detection and the probability of false alarm Pf a 
(i.e. the probability that the radar produce a detection knowing 
that there is no target) ( [5], [21]): 



l(y n ,p,xt n , P ,A n ) = r v^ 7 n,ph^v)^ Tr ,^ j (43) 



and 



1 - P d {x tn ,p,A n ) 



Kdy n ,p) = A 

cont (dy n , P ) + ^disc(dy 71, p ) 



(44) 



The relation between the state and the raw observations is 
given by : 



(33) 



Pd {%t n ,p > A n ) — Pfa 

where p(x tnyP , A n ) is the target signal-to-noise ratio. In the 
case of an ESA radar, it is equal to : 



p(xt niP ,A n ) 



a8 n 



t„,p 



(34) 



where r tntP is the target range and Pt n ,p the azimuth 
associated to target p at instant time i„. a is a coefficient 
which includes all the parameters of the sensor and B is the 
beamwidth of the radar. It is reminded in Appendix A how 
the equations [33] and [34] may be derived. If we make the 
assumption that all the detections are independant, we can 
write : 



¥(D n = d n \X tn =x tn ,A n ) = 

p 

P(-Dn,p = dn t p\X tni p = Xt niP , A n ) 



(35) 



where 



dn,p \-^t n ,p 



*&t n ,p j An) 



Pd{xt n , P ,A n )5 dn p=1 + (1 - Pd(xt n ,p,A n ))S dniP =o 



(36) 



Observation equation: At instant time t n , the radar 
produces a raw observation Y n composed of P measurements 



Yu,p\ 



(37) 



where Y n ,p is the observation related to target of state value 
Xt ntP obtained with action A n (we do not consider here the 
problem of measurement-target association). Moveover, we 
assume that the number of targets P is known. Each of these 
measurements has the following formulation : 

• 1 T 

n,p Hri,p ^*n,pj 



Y, 



L n,p 

where r n>p , n .„, r, 



\^n,p Pn,p ^*n,pj (38) 

p , / n>p are range, azimuth and range rate. The 



equation observation can be written 

F{Y n edy n \X tn =x tn ,A n )= (39) 

p 

JJP(y n , P e dy n jX tntP = x tn ,p,A n ) (40) 



where 

H Y n, P € dy n<p \X tn<p 



x tn ,p,A n ) (41) 
g{y n ,p, A n )\{dy n:P )A2) 



with ht n (xt ntP ) equals to: 
/ \f(rxt^p - rxfj) 2 + (ry tniP - ryfjf 



(45) 



r Vt r , 



- r Vt„ 



\ 



{rx t 



at an 



~ rx t „ S )( vx t n , P -vx° s ) + (ry tn}P -ry° s )(vyt n , P -vy° B ) 



(46) 



7 



\/( rx tn,p- rx 'i'? l a ) 2 +( r ytn,p- r y? 1 ^) 2 

and W n>p a gaussian noise the covariance matrix of which is 
given by : 



T, y = diag(a^,a^,a^.) 



(47) 



State equation: First let us introduce the definition of the 
unknown state X t at time t and its evolution through time. 
X tjP is the state of the target p. It has been defined above. Let 
P be the known number of targets in the space under analysis 
at time t. Xt has the following form: . 

■ X t ,p] T (48) 



X t = [X t ,i 



Based on [22] works, we classically assume that all the 
targets follow a nearly constant velocity model. We use a 
discretized version of this model ( [23]) : 



X t , p = F{X t ^ p , U t ) where U t ~ M (0, a 2 Q) 
where 



F 



"1 





p 








1 





p 








1 














1 



and Q 





£. 

3 





ill 

2 











2 





(49) 



(50) 



V. Simulations 

VI. Conclusion 
Appendix A 

We show in this Appendix how the probability of detection 
is derived. First, the radar transmits a pulse expressed as 
follows 



s(t) — a(t)cos(w c t) 
= Re{a(£)e^*} 



(51) 
(52) 



where a(t) is the envelope also called the transmitted pulse 
and w c the carrier frequency. This pulse is modified by the 
process of reflection. A target is modelled as a set of elemen- 
tary reflectors, each reflecting: time delayed, Doppler shift, 



Phase shift and attenuated version of the transmitted signal. 
We usually assume that the reflection process is linear and 
frequency independent within the bandwidth of the transmitted 
pulse. The return signal has the following formulation: 

s r (t) = GY J ^t-n)g l e ](wAt - T ^ t)+e ' ) + n(t) (53) 

i 

where 

• gi is the radar cross section associated to reflector i, 

• 8i is the phase shift associated to reflector i, 

• fi is the radial velocity between the antenna and the 
object (Doppler frequency shift), 

• G: others losses heavily range dependent due to spatial 
spreading of energy, 

• n(t) is a thermal noise of the receiver such that 
Re{n(i)},Im{n(t)} ~ AA(0,<7 2 ). 

We make the following approximations: 



r-j w r 

a(t — n) w a(t — t) 



(54) 



where r is the mean radial velocity of the target r is the mean 
time delay of the target. Using these approximations, the return 
signal can be rewritten as follows: 



where 



.(t)=a(t-T)Ge? WDt b + n{t) 



(55) 



(56) 



The fluctuations of b are known and modelled using Swerling 
1 model [20]. There are differents models availables (Swerling 
1, 2, 3,...) corresponding to different types of targets. Swerling 
1 given below is convenient for aircrafts. We can then write : 



Re{6},Im{6}~^(0,4 cs ) 



(57) 



This modelling of b assumes that the phase shifts 9i are 
independent and uniformly distributed and the magnitudes 
gi are identically distributed. If the number of reflector is 
large, the central limit theorem gives that b is a complex- 
valued Gaussian random variable centered at the origin. Now, 
a matching filter is applied to our return signal 



/+oo 
s r (t)h(s)ds 
-OO 



(58) 



where h(t) is a shifted, scaled and reversed copy of s r (t) 

h(s) = a(6-t)e- jWD ^-^ . (59) 

We choose t = S + t which yields the best signal to noise ratio 
where 6 is the length of the transmitted pulse. The probability 
of detection is based on quantity \s m (S + r)| 2 . We can show 
that 

/ + OC 
n(S + T- s)h(s)ds . (60) 
- OO 



One can remark that s m (S + r) is the sum of two complex 
value Gaussian variables. We look at the following statistic 

|s m (<5 + r)| 2 



A = 



2al 



and we introduce the following notation 

a s = G PRCS 

Now we construct the test 

{Hi : data generated by signal + noise 
Ho ■ data generated by noise 



Hi : pa(x) = -^—e 



(61) 



(62) 



(63) 



(64) 



H ■■ Pa(x) = e x 
Then, we derive the probability of detection and false alarm. 



Pd = J 7 +OC Pa(x\Hi is true) = e 



Pfa = L +OC Pa{x\H is true ) = e" 



(65) 



Consequently 



— 



(66) 



The ratio ^f- is called the Signal-to-Noise Ration noted p. This 
SNR is related to the parameters of the system and the target. 
The classical radar equation is given by the following formula 
([21]): 



P t G t G r X 2 a 



(67) 



f (47r) 3 r 4 

where P t is the energy of the transmitted pulse, G t is 
the gain of the transmitted antenna, G r is the gain of the 
received antenna, a is the radar cross section (for an aircraft 
between 0.1 and 1 m 2 ), r is the target range, 7 is the system 
noise temperature and L is a general loss term. However, 
the above formula does not take into account for the sake 
of simplicity the losses due to atmospheric attenuation and to 
the imperfection of the radar. Thus , extra terms must be added 

PtG t G r X 2 a 
9 (4^)3fc6L 7 r 4 ; 

where b is the receiver noise bandwith (generally consider 
equal to the signal bandwidth so that b = ■%-), k is Boltzmann's 
constant, 7 is the temperature of the system and L some losses. 
Moreover, the gain reduces with the deviation of the beam 
from the antenna normal in an array antenna. 



G t =Gocos a {d t ), 
G r =G cos a {6 t ) 



(69) 
(70) 



where Go is the gain of the antenna. In [24], a = 2, in [21], 
a = 2.7. According [25], there is also a beam loss because the 



radar beam is not pointing directly so that the radar equation 
is: 



P = 



P t Gl\ 2 a8 t cos 2 



t) 



-PtY 



(4ir) 3 kL'yr 4 
where is B is the beamwidth. 



(71) 
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